4. The Hamiltonian Formalism 



We'll now move onto the next level in the formalism of classical mechanics, due initially 
to Hamilton around 1830. While we won't use Hamilton's approach to solve any further 
complicated problems, we will use it to reveal much more of the structure underlying 
classical dynamics. If you like, it will help us understands what questions we should 
ask. 

4.1 Hamilton's Equations 

Recall that in the Lagrangian formulation, we have the function L{qi,qi,t) where Qi 
{i — 1, . . . ,n) are n generalised coordinates. The equations of motion are 

These are n 2"^^ order differential equations which require 2n initial conditions, say 
qi{t — 0) and qi{t — 0). The basic idea of Hamilton's approach is to try and place 
and Qi on a more symmetric footing. More precisely, we'll work with the n generalised 
momenta that we introduced in section 2.3.3, 

dL , , 

Pi = — ^ = l,...,n 4.2 

so Pi = Pi{qj,qj,t). This coincides with what we usually call momentum only if we 
work in Cartesian coordinates (so the kinetic term is ^rriiqf) . If we rewrite Lagrange's 
equations (4.1) using the definition of the momentum (4.2), they become 

P. = 1^ (4.3) 
dqi 

The plan will be to eliminate qi in favour of the momenta pi, and then to place qi and 
Pi on equal footing. 





Figure 50: Motion in configuration space on the left, and in phase space on the right. 
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Let's start by thinking pictorially. Recall that {qi} defines a point in n-dimensional 
configuration space C . Time evolution is a path in C. However, the state of the system 
is defined by {q^} and {pi} in the sense that this information will allow us to determine 
the state at all times in the future. The pair {qi/Pi} defines a point in 2n-dimensional 
phase space. Note that since a point in phase space is sufficient to determine the future 
evolution of the system, paths in phase space can never cross. We say that evolution 
is governed by a flow in phase space. 

An Example: The Pendulum 

Consider a simple pendulum. The configuration space is clearly a circle, 5^, parame- 
terised by an angle 9 G [— vr, tt). The phase space of the pendulum is a cylinder Rx S^, 
with the R factor corresponding to the momentum. We draw this by fiattening out the 
cylinder. The two different types of motion are clearly visible in the phase space fiows. 
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Figure 51: Flows in the phase space of a pendulum. 

For small 6 and small momentum, the pendulum oscillates back and forth, motion 
which appears as an ellipse in phase space. But for large momentum, the pendulum 
swings all the way around, which appears as lines wrapping around the of phase 
space. Separating these two different motions is the special case where the pendulum 
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starts upright, falls, and just makes it back to the upright position. This curve in phase 
space is called the separatix. 



4.1.1 The Legendre Transform 

We want to find a function on phase space that will determine the unique evolution 
of Qi and Pi. This means it should be a function of qi and pi (and not of qi) but must 
contain the same information as the Lagrangian L(qi,qi,t). There is a mathematical 
trick to do this, known as the Legendre transform. 

To describe this, consider an arbitrary function f{x, y) so that the total derivative is 

df Of 

Now define a function g{x, y, u) — ux — f{x, y) which depends on three variables, x, y 
and also u. If we look at the total derivative of g^, we have 

dg — diux) — df — udx + xdu — -rr-dx — -rr-dy (4.5) 

ox ay 

At this point u is an independent variable. But suppose we choose it to be a specific 
function of x and y, defined by 

df 

^i(2;,y) = ^ (4.6) 

Then the term proportional to dx in (4.5) vanishes and we have 

df 

dg — xdu — —dy (4.7) 
dy 

Or, in other words, g is to be thought of as a function of u and y: g = g{u,y). If we 
want an explicit expression for g{u, y), we must first invert (4.6) to get x — x{u, y) and 
then insert this into the definition of g so that 

g{u,y) = ux{u,y) - f{x{u,y),y) (4.8) 

This is the Legendre transform. It takes us from one function f{x, y) to a different func- 
tion g{u,y) where u = df /dx. The key point is that we haven't lost any information. 
Indeed, we can always recover f{x,y) from g{u,y) by noting that 



dg_ 
du 



^-x(u,y) and ^ 



-v{u,y) (4.9) 



which assures us that the inverse Legendre transform / = —{dg/du)u — g takes us back 
to the original function. 
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The geometrical meaning of the Lcgcndrc transform 
is captured in the diagram. For fixed y, we draw the two 
curves f{x, y) and ux. For each slope the value of g{u) 
is the maximal distance between the two curves. To see 
this, note that extremising this distance means 



^ {ux - f{x)) = 



This picture also tells us that we can only apply the Legen- 
dre transform to convex functions for which this maximum 
exists. Now, armed with this tool, let's return to dynamics 




Figure 52: 



4.1.2 Hamilton's Equations 

The Lagrangian L{qi, qi, t) is a function of the coordinates Qi, their time derivatives Qi 
and (possibly) time. We define the Hamiltonian to be the Legendre transform of the 
Lagrangian with respect to the (ji variables. 



H{qi,Pi,t) = ^PiQi - L{qi,qi,t) 



i=l 



where Qi is eliminated from the right hand side in favour of pi by using 

dL 



Pi 



dqi 



Pi{QjAj:t) 



(4.11) 



(4.12) 



and inverting to get — qi{qj,Pj, t). Now look at the variation of H: 

ITT / 7 • 7 • \ f ^ dL ^. (9L , 

dH = [dpi qi + Pi dqi) - -^dqi + -^dqi + —dt 



dqi 

, . dL ^ dL ^ 
dPi Qi - -K-dqi - -^dt 
oqi at 



dqi 



dt 



but we know that this can be rewritten as 

dH 



OH ^ dH , dH , 
-^dqi + — dpi + ^dt 
dqi dpi dt 



(4.13) 



(4.14) 



So we can equate terms. So far this is repeating the steps of the Legendre transform. 
The new ingredient that we now add is Lagrange's equation which reads Pi = dL/dqi. 
We find 

dH 



Pi 

Qi 

dL 
~dt 



dqi 
dH 

dpi 
dH 

~dt 



(4.15) 
(4.16) 
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These are Hamilton's equations. We have replaced n 2"^^ order differential equations by 
2n order differential equations for qi and pi. In practice, for solving problems, this 
isn't particularly helpful. But, as we shall see, conceptually it's very useful! 

4.1.3 Examples 

1) A Pcirticle in a Potential 

Let's start with a simple example: a particle moving in a potential in 3-dimensional 
space. The Lagrangian is simply 

L = ^mr^ - V{r) (4.17) 

We calculate the momentum by taking the derivative with respect to r 

dL 

p = -pr^ = mr (4.18) 
ar 

which, in this case, coincides with what we usually call momentum. The Hamiltonian 
is then given by 

= p • r - L = — p2 + y(r) (4.19) 
2m 

where, in the end, we've eliminated r in favour of p and written the Hamiltonian as a 
function of p and r. Hamilton's equations are simply 



dE _ 1 
dE 



ap m 



dr 



-VV (4.20) 



which are familiar: the first is the definition of momentum in terms of velocity; the 
second is Newton's equation for this system. 

2) A Pcirticle in an Electromagnetic Field 

We saw in section 2.5.7 that the Lagrangian for a charged particle moving in an elec- 
tromagnetic field is 



1 



L=imP-e(0--r-A) (4.21) 



From this we compute the momentum conjugate to the position 



dL . e ^ /. 
p = ^ = mr+-A (4.22) 
or c 
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which now differs from what we usually call momentum by the addition of the vector 
potential A. Inverting, we have 

r = 1 f p _ - a) (4.23) 

m \ c / 

So we calculate the Hamiltonian to be 



1 / e 

= — P • P 

m 

_ 1 / e .\2 

2m 



H{p,r) = p-r-L 

(p--a)- — (p--A)'-e0+ — (p--A^ 
\ c / [zm \ c / cm \ c J 

(p-^A)' + e0 (4.24) 
Now Hamilton's equations read 

r=§ = l(p-£A) (4.25) 
op m \ c / 

while the p = —dH/dr equation is best expressed in terms of components 

dH dd) e ( e , \ dAu 

Pa = = + — (Ph --AA-^ 4.26 

or a or a cm \ c J or a 

To show that this is equivalent to the Lorentz force law requires some rearranging of 
the indices, but it's not too hard. 

An Example of the Example 

Let's illustrate the dynamics of a particle moving in a magnetic field by looking at a 
particular case. Imagine a uniform magnetic field pointing in the 2;-direction: B = 
(0, 0, B). We can get this from a vector potential B = V x A with 

A=(-Sy,0,0) (4.27) 

This vector potential isn't unique: we could choose others related by a gauge transform 
as described in section 2.5.7. But this one will do for our purposes. Consider a particle 
moving in the (a;, |/)-plane. Then the Hamiltonian for this system is 

Prom which we have four, first order differential equations which are Hamilton's equa- 
tions 
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X 



Py 



1 f 


eB \ 


— 1 Px 


c ^ 


Tfl \ 




eB 1 


eB 




V-r- H 


mc \ 


, c 


Py 




m 





(4.29) 



If we add these together in the right way, we find 



that 



and 



eB 

Py H X — a — const. 



. eB 

Px — mx y = = const. 

c 



(4.30) 
(4.31) 




which is easy to solve: we have 



Figure 53: 



ac 



X — — — + i?sin {uj{t — to)) 
eB 

be ^ . . 
y = — ^ + -Rcos {uj{t - to)) 
ejD 



(4.32) 



with a, b, R and to integration constants. So we see that the particle makes circles in 
the {x, y)-plane with frequency 

eB 



mc 



(4.33) 



This is known as the Larmor frequency. 
4.1.4 Some Conservation Laws 

In Section 2, we saw the importance of conservation laws in solving a given problem. 
The conservation laws are often simple to see in the Hamiltonian formalism. For ex- 
ample, 

Claim: If dH/dt = (i.e. H does not depend on time explicitly) then H itself is 
a constant of motion. 



Proof: 



dH 
'dt 



dH dH dH 

OQi dpi dt 
dH 
dt 

_ dH 
~ 'dt 



(4.34) 
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Claim: If an ignorable coordinate q doesn't appear in the Lagrangian then, by con- 
struction, it also doesn't appear in the Hamiltonian. The conjugate momentum pq is 
then conserved. 



Proof 



(4.35) 



4.1.5 The Principle of Least Action 

Recall that in section 2.1 we saw the principle of least action from the Lagrangian 
perspective. This followed from defining the action 

(4.36) 



I L{qi,qi,t) 
Jti 



dt 



Then we could derive Lagrange's equations by insisting that ^yS" = for all paths with 
fixed end points so that 5qi{ti) — 5qi{t2) — 0. How does this work in the Hamiltonian 
formalism? It's quite simple! We define the action 

S= [\piqi-H)dt (4.37) 

where, of course, = qi{qi,Pi)- Now we consider varying andpj independently. Notice 
that this is different from the Lagrangian set-up, where a variation of qi automatically 
leads to a variation of q^. But remember that the whole point of the Hamiltonian 
formalism is that we treat qi and pi on equal footing. So we vary both. We have 



6S 



t2 



t2 



OH ^ dH ^ 
m qi + PiOqi - ^ dqi 

dpi dqi 



dt 



dH 
dpi 



5pi + 



-Pi 



dH 
dqi 



dqi \dt + \pidq, 



,t2 

hi 



(4.38) 



and there are Hamilton's equations waiting for us in the square brackets. If we look 
for extrema 5S — for all 5pi and 5qi we get Hamilton's equations 

dH , dH 
— and Pi = - — 
dpi dqi 



Qi 



(4.39) 



Except there's a very slight subtlety with the boundary conditions. We need the last 
term in (4.38) to vanish, and so require only that 



Sqiit,) = Sqi{t2) = 



(4.40) 
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while Spi can be free at the end points t = ti and t = t2- So, despite our best efforts, 
qi and pi are not quite symmetric in this formalism. 

Note that wc could simply impose 5pi{ti) — 5pi{t2) = if we really wanted to and 
the above derivation still holds. It would mean we were being more restrictive on the 
types of paths we considered. But it does have the advantage that it keeps qi and pi 
on a symmetric footing. It also means that we have the freedom to add a function to 
consider actions of the form 

S = £{pA-H(,,p)+''-l^) (4.41) 

so that what sits in the integrand differs from the Lagrangian. For some situations this 
may be useful. 

4.1.6 William Rowan Hamilton (1805-1865) 

The formalism described above arose out of Hamilton's interest in the theory of optics. 
The ideas were published in a series of books entitled "Theory of Systems of Rays" , the 
first of which appeared while Hamilton was still an undergraduate at Trinity College, 
Dublin. They also contain the first application of the Hamilton- Jacobi formulation 
(which we shall see in Section 4.7) and the first general statement of the principal of 
least action, which sometimes goes by the name of "Hamilton's Principle" . 

Hamilton's genius was recognised early. His capacity to soak up classical languages 
and to find errors in famous works of mathematics impressed many. In an unprece- 
dented move, he was offered a full professorship in Dublin while still an undergraduate. 
He also held the position of "Royal Astronomer of Ireland", allowing him to live at 
Dunsink Observatory even though he rarely did any observing. Unfortunately, the 
later years of Hamilton's life were not happy ones. The woman he loved married an- 
other and he spent much time depressed, mired in drink, bad poetry and quaternions. 

4.2 Liouville's Theorem 

We've succeeded in rewriting classical dynamics in terms of first order differential equa- 
tions in which each point in phase space follows a unique path under time evolution. 
We speak of a flow on phase space. In this section, we'll look at some of the properties 
of these fiows 

Liouville's Theorem: Consider a region in phase space and watch it evolve over 
time. Then the shape of the region will generically change, but Liouville's theorem 
states that the volume remains the same. 
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Figure 54: An infinitesimal volume element of phase space evolving in time. 



Proof: Let's consider an infinitesimal volume moving for an infinitesimal time. We 
start in a neighbourhood of the point {qi,Pi) in phase space, with volume 

V ^ dqi.. . dqndpi ...dpn (4.42) 

Then in time dt, we know that 

dH 

Qi^ qi + Qidt ^ qi + -7:—dt = Qi (4.43) 

dpi 

and 

OH 

Pi^Pi+ Pidt ^Pi- -^dt = Pi (4.44) 

dqi 

So the new volume in phase space is 

V ^ dqi... dqndpi . . . dpn = (det J) V (4.45) 

where det is the Jacobian of the transformation defined by the determinant of the 
2n X 2n matrix 

j^fdq./dq, dq./dpA 
\ dpi/dqj dpi/dpj J 

To prove the theorem, we need to show that det J" = 1. First consider a single degree 
of freedom (i.e. n — 1). Then we have 

/ 1 + (d^H/dpdq)dt id^H/dp^) dt \ , 
det J = det ^ ' ^ ^' ^ ' ^' \=l + 0{dt^) 4.47 

\ -{d^H/dq^) dt 1 - (d^H/dqdp) dt J 

which means that 

d{det J) 
Jt 



(4.48) 
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so that the volume remains constant for all time. Now to generalise this to arbitrary 
n, we have 



det J = det + id'H/dpidqj)dt {d'H/dp,dpj) dt 

To compute the determinant, we need the result that det(l + eM) — 1 + e TrM + O(e^) 
for any matrix M and small e. Then we have 

and we're done. □ 
4.2.1 Liouville's Equation 

So how should we think about the volume of phase space? We could consider an 
ensemble (or collection) of systems with some density function p{p,q,t). We might 
want to do this because 

• We have a single system but don't know the exact state very well. Then p is 
understood as a probability parameterising our ignorance and 

/ p{q,P,t) n dpidqi = 1 (4.51) 

• We may have a large number N of identical, non-interacting systems (e.g. N — 
10^^ gas molecules in ajar) and we really only care about the averaged behaviour. 
Then the distribution p satisfies 

/ p{q,P,t) lldqidpi = N (4.52) 

In the latter case, wc know that particles in phase space (i.e. dynamical systems) 
are neither created nor destroyed, so the number of particles in a given "comoving" vol- 
ume is conserved. Since Liouville tells us that the volume elements dpdq are preserved, 
we have dp/dt = 0. We write this as 

dp^dp^d^.^^d^^^ 
dt dt dqi * dpi 

^ dp ^ dp_dl^ _ dp_dl^ ^ ^ 
dt dqi dpi dpi dqi 
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Rearranging the terms, we have, 

dp dp dH dp dH 
dt dpi dqi dqi dpi 

which is Liouville's equation. 

Notice that Liouville's theorem holds whether or not the system conserves energy, 
(i.e. whether or not dH/dt = 0). But the system must be described by a Hamiltonian. 
For example, systems with dissipation typically head to regions of phase space with 
qi — and so do not preserve phase space volume. 

The central idea of Liouville's theorem - that volume of phase space is constant - 
is somewhat reminiscent of quantum mechanics. Indeed, this is the first of several oc- 
casions where we shall see ideas of quantum physics creeping into the classical world. 
Suppose we have a system of particles distributed randomly within a square AqAp in 
phase space. Liouville's theorem implies that if we evolve the system in any Hamil- 
tonian manner, we can cut down the spread of positions of the particles only at the 
cost of increasing the spread of momentum. We're reminded strongly of Heisenberg's 
uncertainty relation, which is also written AqAp — constant. 

While Liouville and Heisenberg seem to be talking the same language, there are very 
profound differences between them. The distribution in the classical picture reflects 
our ignorance of the system rather than any intrinsic uncertainty. This is perhaps best 
illustrated by the fact that we can evade Liouville's theorem in a real system! The 
crucial point is that a system of classical particles is really described by collection of 
points in phase space rather than a continuous distribution p{q,p) as we modelled it 
above. This means that if we're clever we can evolve the system with a Hamiltonian 
so that the points get closer together, while the spaces between the points get pushed 
away. A method for achieving this is known as stochastic cooling and is an important 
part of particle collider technology. In 1984 van der Meer won the the Nobel prize for 
pionering this method. 

4.2.2 Time Independent Distributions 

Often in physics we're interested in probability distributions that don't change explicitly 
in time (i.e. dp/dt — 0). There's an important class of these of the form, 

p^p{H{q,p)) (4.55) 

To see that these are indeed time independent, look at 

dp dp dH dp dH 
dt dpi dqi dqi dpi 
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(4.54) 



dp OH OH dp dH dH 
dH dpi dqi dH dqi dpi 



= 



(4.56) 



A very famous example of this type is the Boltzmann distribution 



exp 



( 



kT 



(4.57) 



for systems at a temperature T. Here k is the Boltzmann constant. 

For example, for a free particle with H = p^/2m, the Boltzmann distribution is 
p = exp(— mr^/2A;T) which is a Gaussian distribution in velocities. 

An historically more interesting example comes from looking at a free particle in a 
magnetic field, so H = (jp — eA)^/2m. (where we've set the speed of light c = 1 for 
simplicity). Then the Boltzmann distribution is 



which is again a Gaussian distribution of velocities. In other words, the distribution 

in velocities is independent of the magnetic field. But this is odd: the magnetism of 
solids is all about how the motion is electrons is affected by magnetic fields. Yet we've 
seen that the magnetic field doesn't affect the velocities of electrons. This is known as 
the Bohr-van Leeuwen paradox: there can be no magnetism in classical physics! This 
was one of the motivations for the development of quantum theory. 

4.2.3 Poinccire Recurrence Theorem 

We now turn to work of Poincare from around 1890. The following theorem apphes to 
systems with a bounded phase space (i.e. of finite volume). This is not an uncommon 
occurrence. For example, if we have a conserved energy E = T + V with T > and 
V > then the accessible phase space is bounded by the spatial region V{r) < E. 
With this in mind, we have 




(4.58) 




Figure 55: The Hamiltonian map in a time step T. 
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Theorem: Consider an initial point P in phase space. Then for any neighbourhood 
Dq of P, there exists a point P' e Dq that wiU return to Dq in a finite time. 




Figure 56: 



(4.59) 



Proof: Consider the evolution of Dq over a finite time 
interval T. Hamilton's equations provide a map Dq i— > Di 
shown in figure 55. By Liouville's theorem, wc know that 
Vol{Do) = Vol{Di), although the shapes of these two regions 
will in general be different. Let Dk be the region after time kT 
where k is an integer. Then there must exist integers k and k' 
such that the intersection of and 0^/ is not empty: 

(If this isn't true then the total volume IJfc!Lo ~^ ^ hut, 
by assumption, the phase space volume is finite). Take k' > k 
such that uJk,k' = Dk H Dk' 7^ 0. But since the Hamiltonian 
mapping Dk -Dfc+i is invertible, we can track backwards to 
find oJo^k'-k = Dq n Dk'-k 7^ 0. So some point P' e Dq has 
returned to D in k' — k time steps T. □ 

What does the Poincare recurrence theorem mean? Consider 
gas molecules all in one corner of the room. If we let them go, 
they fill the room. But this theorem tells us that if we wait long enough, they will all 
return once more to the corner of the room. The trick is that the Poincare recurrence 
time for this to happen can easily be longer than the lifetime of the universe! 




Figure 57: 



Figure 58: Eventually all the air molecules in a room will return to one corner. 



Question: Where's your second law of thermodynamics now?! 
4.3 Poisson Brackets 

In this section, we'll present a rather formal, algebraic description of classical dynamics 
which makes it look almost identical to quantum mechanics! We'll return to this 
analogy later in the course. 
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We start with a definition. Let f{q,p) and g{q,p) be two functions on pfiase space. 
Ttien tfie Poisson bracket is defined to be 

{/,S} = |/|L-|/|L (4.60) 

dQi dpi dpi dqi 

Since tliis is a kind of weird definition, let's look at some of the properties of the Poisson 
bracket to get a feel for it. We have 

• = 

• linearity: {af + Pg, h} = a{f, h} + I3{g, h} for all a, /5 e R. 

• Leibniz rule: {fg, h} — f{g, h} + {/, h}g which follows from the chain rule in 
differentiation. 

• Jacobi identity: {/, {g, h}} + {g, {h, /}} + {h, {/, g}} = 0. To prove this you 
need a large piece of paper and a hot cup of coffee. Expand out all 24 terms and 
watch them cancel one by one. 

What we've seen above is that the Poisson bracket { , } satisfies the same algebraic 
structure as matrix commutators [ , ] and the differentiation operator d. This is related 
to Heisenberg's and Schrodinger's viewpoints of quantum mechanics respectively. (You 
may be confused about what the Jacobi identity means for the derivative operator d. 
Strictly speaking, the Poisson bracket is hke a "Lie derivative" found in differential 
geometry, for which there is a corresponding Jacobi identity) . 

The relationship to quantum mechanics is emphasised even more if we calculate 

{g„gj} = 

{p^,Pj} = (4.61) 

{qi,Pj} = 

We'll return to this in section 4.8. 



Claim: For any function f{q,p,t), 



Proof: 



dt dpi * dqi * dt 
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- dl£ dim dl 
dpi dqi dqi dpi dt 

Isn't this a lovely equation! One consequence is that if we can find a function /(p, q) 
which satisfy 

{/, H}^0 (4.64) 

then / is a constant of motion. We say that / and H Poisson commute. As an example 
of this, suppose that qi is ignorable (i.e. it does not appear in H) then 

{Pi,H}^Q (4.65) 

which is the way to see the relationship between ignorable coordinates and conserved 
quantities in the Poisson bracket language. 

Note that if / and J are constants of motion then {{/, J}, H} — {/, {J, H}} + 
{{/, if}, J} = which means that {/, J} is also a constant of motion. We say that the 
constants of motion form a closed algebra under the Poisson bracket. 

4.3.1 An Example: Angular Momentum and Runge-Lenz 

Consider the angular momentum L = r x p which, in component form, reads 

Li = r2P3 - rzP2 , L2 = rapi - ripa , L3 = rip2 - r2Pi (4.66) 

and let's look at the Poisson bracket structure. We have 

{Li, L2] = {r2j53 - r3P2, r^ipi - rip:^] 
= {r2Pz,rzPi} + {r3P2,riP3} 

= -r2Pi + P2ri = (4.67) 

So if Li and L2 are conserved, we see that L3 must also be conserved. Or, in other 
words, the whole vector L is conserved if any two components are. Similarly, one can 
show that 

{L\Li}^Q (4.68) 
where = Y^iLf. This should all be looking famihar from quantum mechanics. 
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Another interesting object is the (Hermann-BernouUi-Laplace-Pauh-) Runge-Lenz 
vector, defined as 

A = — pxL-f (4.69) 

m 

where r = r/r. This vector satisfies A ■ L = 0. If you're wilhng to spend some time 
playing with indices, it's not hard to derive the following expressions for the Poisson 
bracket structure 

{La, Ab) = eabcAc , {Aa, A} = -— - -\ SabcLc (4.70) 

The last of these equations suggests something special might happen when we consider 
the familiar Hamiltonian H = p^/2m — 1/r so that the Poisson bracket becomes 

[Aa, Ab} = eabcLc (4.71) 

m 

Indeed, for this choice of Hamiltonian is a rather simple to show that 

{H, A} = (4.72) 

So we learn that the Hamiltonian with —1/r potential has another constant of motion A 
that we'd previously missed! The fact that A is conserved can be used to immediately 
derive Kepler's elliptical orbits: dotting A with r yields r ■ A + 1 = L^/r which is the 
equation for an ellipse. Note that the three constants of motion, L, A and H form a 
closed algebra under the Poisson bracket. 

Noether's theorem tells us that the conservation of L and H are related to rotational 
symmetry and time translation respectively. One might wonder whether there's a 
similar symmetry responsible for the conservation of A. It turns out that there is: the 
Hamiltonian has a hidden 5*0(4) symmetry group. You can read more about this in 
Goldstein. 

4.3.2 An Example: Magnetic Monopoles 

We've seen in the example of section 4.1.3 that a particle in a magnetic field B = V x A 
is described by the Hamiltonian 

where, as usual in the Hamiltonian, r is to be thought of as a function of r and p. It's 
a simple matter to compute the Poisson bracket structure for this system: it reads 

g 

{mr-a, mfb) = - eabcBc , {mfa, n) = -5ab (4.74) 
c 



- 96 - 



Let's now use this to describe a postulated object known as a magnetic monopole. It's 
a fact that all magnets ever discovered are dipoles: they have both a north and south 
pole. Chop the magnet in two, and each piece also has a north and a south pole. 
Indeed, this fact is woven into the very heart of electromagnetism when formulated in 
terms of the gauge potential A. Since we define B = V x A, we immediately have one 
of Maxwell's equations, 

V • B - (4.75) 

which states that any flux that enters a region must also leave. Or, in other words, 
there can be no magnetic monopole. Such a monopole would have a radial magnetic 
held, 

B = ^ ^ (4.76) 

which doesn't satisfy (4.75) since it gives rise to a delta function on the right-hand side. 
So if magnetic monopoles have never been observed, and are forbidden by Maxwell's 
equations, why are we interested in them?! The point is that every theory that goes be- 
yond Maxwell's equations and tries to unify electromagnetism with the other the forces 
of Nature predicts magnetic monopoles. So there's reason to suspect that, somewhere 
in the universe, there may be particles with a radial magnetic field given by (4.76). 

What happens if an electron moves in the background of a monopole? It's tricky 
to set up the Lagrangian as we don't have a gauge potential A. (Actually, one can 
work with certain singular gauge potentials but we won't go there). However, we can 
play with the Poisson brackets (4.74) which contain only the magnetic field. As an 
application, consider the generalised angular momentum, 

J = mrxr-— r (4.77) 

where r = r/r. For g — 0, this expression reduces to the usual angular momentum. It is 
a simple matter to show using (4.74) that in the background of the magnetic monopole 
the Hamiltonian H = ^mr"^ and J satisfy 

{H, J} = (4.78) 

which guarantees that J is a constant of motion. What do we learn from this? Since 
J is conserved, we can look at r ■ J = —eg/c to learn that the motion of an electron in 
the background of a magnetic monopole lies on a cone of angle cos^ = eg/cJ pointing 
away from the vector J. 
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4.3.3 An Example: The Motion of Vortices 

The formal structure of Poisson brackets that we've introduced here can be employed 
even when it's not obvious that we're talking about coordinates and momenta. To 
illustrate this, consider the rather odd motion of line vortices moving in a plane. For n 
vortices with positions rj = {xi, yi), each with strength ji, the equations of motion are 



^^^^ Ir- - r -p 

liVi = + z^iaj] ^ 

^ ■'ri-r^p 

where there is no sum over i on the left hand side of 
these equations. Notice that these are first order equa- 
tions for the position variables, rather than for position 
and momentum. How can we cast this dynamics in a 
Hamiltonian framework? The trick is to consider one of 
the positions as a "canonical momentum". We consider 
the Hamiltonian 



(4.79) 





i<j 



(4.80) 



Figure 59: 



which is endowed with the Poisson bracket structure 



^ -fi \dxi dyi dyi dxi) 



In particular, we have 



(4.81) 



(4.82) 



Using this, we can check that Hamilton's equations reproduce the equations of motion. 



Vi = {Vi, — 

7i dxi 



(4.83) 



The system has a number of conserved quantities. Firstly, there is the "total momen- 
tum" , 



(4.84) 
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which satisfy {Px, H} = {Py, H} = 0, ensuring that they are conserved quantities. 
We also have {Px,Py} = X^j7i and the right hand side, being constant, is trivially 
conserved. 

The other conserved quantity is the "total angular momentum" , 

n 

1=1 

which again satisfies {J,H} = 0, ensuring it is conserved. The full algebra of the 
conserved quantities includes {P^, J} — —Py and {Py, J} — P^, so the system closes 
(meaning we get back something we know on the right hand side). In fact, one can 
show that if, J and (Pj + Py) provide three mutually Poisson commuting conserved 
quantities. 

So what is the resulting motion of a bunch of vortices? For two vortices, we can 
simply solve the equations of motion to find, 

Xi-X2 = -Rsin (-^(^ - ^o)) 

yi-y2 = i?cos(-^(i-io)) (4.86) 

where it! is the separation between the vortices and uj — {^i + ^2) / Fl? ■ So we learn 
that two vortices orbit each other with frequency inversely proportional to the square 
of their separation. 

For three vortices, it turns out that there is a known solution which is possible 
because of the three mutually Poisson commuting conserved quantities we saw above. 
We say the system is "integrable" . We'll define this properly shortly. For four or more 
vortices, the motion is chaotic^. 

You may think that the Poisson bracket structure {x, |/} 7^ looks a little strange. 
But it also appears in a more familiar setting: a charged particle moving in a magnetic 
field B = (0, 0, S). We saw this example in section 4.1.3, where we calculated 

eB 

Px = mx y (4.87) 

mc 

For large magnetic fields the second term in this equation dominates, and we have 
Pj. f» —eBy/mc. In this case the Poisson bracket is 

Tnc 

{x,p^} = l ^ {x,y}^ (4.88) 



^For more details on this system, see the review H. Aref, '^Integrable, Chaotic, and Turbulent Vortex 
Motion in Two-Dimensional Flows", Ann. Rev. Fluid Mech. 15 345 (1983). 
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This algebraic similarity between vortices and electrons is a hot topic of current re- 
search: can we make vortices do similar things to electrons in magnetic fields? For 
example: will vortices in a Bose-Einstein condensate form a fractional quantum Hall 
state? This is currently an active area of research. 

4.4 Canonical Transformations 

There is a way to write Hamilton's equations so that they look even more symmetric. 
Define the 2n vector x = {qi, . . . ,qn,pi, . . . ,Pn)^ ^-nd the 2n x 2n matrix J, 



.J (4-89) 




where each entry is itself an nxn matrix. Then with this notation, Hamilton's equations 
read 

x=7^ (4.90) 

Now remember that in the Lagrangian formalism we made a big deal about the fact that 
we could change coordinates Qi — > Qi{q) without changing the form of the equations. 
Since we've managed to put Qi and pi on an equal footing in the Hamiltonian formalism, 
one might wonder if its possible to make an even larger class of transformations of the 
form, 

qi^Qiiq,p) and Pi^Piiq,p) (4.91) 

The answer is yes! But not all such transformations are allowed. To see what class of 
transformations leaves Hamilton's equations invariant, we use our new symmetric form 
in terms of x and write the transformation as 

Xi ^ Viix) (4.92) 

Note that we'll continue to use the index i which now runs over the range i — 1, . . . , 2n. 
We have 

■. = ^x=—J —— (4 93) 

* dxj ^ dxj ^'^ dyi dxk 

or, collating all the indices, we have 

y^iJJJ^-^ (4.94) 
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where Jij = dyi/dxj is the Jacobian that we met in section 4.2. We see that Hamilton's 
equations are left invariant under any transformation whose Jacobian J' satisfies 

jjjT^j ^ = (4.95) 

The Jacobian J' is said to be symplectic if this holds. A change of variables with a 
symplectic Jacobian is said to be a canonical transformation. 

There is a nice method to construct canonical transformations using "generating 
functions" which wc will mention in section 4.4.3. Before we get to this, let's look 
at some uses. We start by proving a theorem relating canonical transformations with 
Poisson brackets. 



Theorem: The Poisson bracket is invariant under canonical transformations. Con- 
versely, any transformation which preserves the Poisson bracket structure so that 

{Qi,Qj} = {Pi,Pj} = and {Qi,Pj} = 5ij (4.96) 

is canonical. 



Proof: Let's start by showing that the Poisson bracket is invariant under canonical 
transformations. Consider two functions /(xj) and g{xi). Then, 

If 1 ^ df dg df dg 
dqi dpi dpi dqi 

= ^Jr^ (4.97) 
dxi ''^ dxj 

So if X ^ y{x), we have 

|^ = |^J« (4.98) 

dxi dyk 

and, assuming the transformation is canonical, the Poisson bracket becomes 

{/, 9} = ^ Ju J^, Jij ^-^Jki^ (4.99) 
dyk dyi dyk dyi 

This means that we can compute our Poisson brackets in any coordinates related by a 
canonical transformation. Now let's show the converse. Go back to the notation {qi,Pi) 
and the new coordinates {Qi{q,p), Pi{q,p))- The Jacobian is given by 

fdQ./dq, dQ^/dpA 
\dPjdq, dP,/dp, 
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If we now compute JJJ^ in components, we get 



So whenever the Poisson bracket structure is preserved, the transformation is canonical. 

□ 

Example 

In the next section we'll see several non-trivial examples of canonical transformations 
which mix up q and p variables. But for now let's content ourselves with reproducing 
the coordinate changes that we had in section 2. Consider a change of coordinates of 
the form 

qi^QM) (4.102) 

We know that Lagrange's equations are invariant under this. But what transformation 
do we have to make on the momenta 

Pi^Pi{q,p) (4.103) 

so that Hamilton's equations are also invariant? We write Qij — dQi/dqj and look at 
the Jacobian 



dPi/dqj dPi/dp 



in order for the transformation to be canonical, we require J J — J. By expanding 
these matrices out in components, we see that this is true if 

P, = {Q-^)jiPj (4.105) 

This is as we would expect, for it's equivalent to Pj = dL/dQi. Note that although 
Qi — Qiiq) only. Pi 7^ Pi{p)- Instead, the new momentum Pj depends on both q and p. 

4.4.1 Infinitesimal Canonical Transformations 

Consider transformations of the form 

qi ^ Qi = qi + aFi{q,p) 

Pi^ Pi=Pi + aEi{q,p) (4.106) 
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where a is considered to be infinitesimally small. What functions Fi{q,p) and Ei{q,p) 
are allowed for this to be a canonical transformation? The Jacobian is 

f Sij + adFi/dqj adFi/dpj ^^^^^^ 
^ \ a dEi I dqj 5ij + a dEi / dpj 

so the requirement that J JJ^ — J gives us 

dF^^_dE^ 
dqj dpj 

which is true if 



(4.108) 



dC dC 
Fi^— and Ei^-— (4.109) 
dpi dqi 

for some function G{q,p). We say that G generates the transformation. 

This discussion motivates a slightly different way of thinking about canonical trans- 
formations. Suppose that we have a one-parameter family of transformations, 

Qi^ Qi{q:P;a) and Pi ^ Pi{q,p; a) (4.110) 

which are canonical for all o; e R and have the property that Qi{q,p; a — 0) — qi and 
Pi{q,p; a — 0) — Pi- Up until now, we've been thinking of canonical transformations in 
the "passive" sense, with the {Qi, Pi) labelling the same point in phase space as {qi,Pi), 
just in different coordinates. But a one-parameter family of canonical transformations 
can be endowed with a different interpretation, namely that the transformations take 
us from one point in the phase space {qi,Pi) to another point in the same phase space 
{Qi{q,p;a), Pi{q,p;a)). In this "active" interpretation, as we vary the parameter a 
we trace out hues in phase space. Using the results (4.106) and (4.109), the tangent 
vectors to these lines are given by, 

^ ^ U III) 

da dpi da dqi 

But these look just like Hamilton's equations, with the Hamiltonian replaced by the 
function G and time replaced by the parameter a. What we've found is that every 
one-parameter family of canonical transformations can be thought of as "Hamiltonian 
flow" on phase space for an appropriately chosen "Hamiltonian" G. Conversely, time 
evolution can be thought of as a canonical transformation for the coordinates 

iqiito),Piito)) ^ iqiit),Piit)) (4.112) 

generated by the Hamiltonian. Once again, we see the link between time and the 
Hamiltonian. 
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As an example, consider the function G = pk- Then the corresponding infinitesimal 
canonical transformation is Qi ^ Qi + a6ik and pi ^ Pi, which is simply a translation. 
We say that translations of Qk are generated by the conjugate momentum G = p^. 

4.4.2 Noether's Theorem Revisited 

Recall that in the Lagrangian formalism, we saw a connection between symmetries and 
conservation laws. How does this work in the Hamiltonian formulation? 

Consider an infinitesimal canonical transformation generated by G. Then 

dH ^ dH ^ 
oH ^ —dqi + — dpi 

dH dG _ dH dG ^ 
dqi dpi dpi dqi 
^a{H,G} (4.113) 

The generator G is called a symmetry of the Hamiltonian if SH — 0. This holds if 

{G,H} = (4.114) 

But wc know from section 4.3 that G = {G. H}. Wc have found that if G is a symmetry 
then G is conserved. Moreover, wc can reverse the argument. If we have a conserved 
quantity G, then we can always use this to generate a canonical transformation which 
is a symmetry. 

4.4.3 Generating PXinctions 

There's a simple method to construct canonical transformations between coordinates 
{qi,Pi) and {Qi, Pi). Consider a function F{q, Q) of the original g^'s and the final Qi's. 
Let 

dF 

dqi 

After inverting, this equation can be thought of as defining the new coordinate Q; ~ 
Qiil^p)- But what is the new canonical momentum P? We'll show that it's given by 

dF 



Pi 



(4.115) 



^Q^ 



(4.116) 



The proof of this is a simple matter of playing with partial derivatives. Let's see how 
it works in an example with just a single degree of freedom. (It generalises trivially to 
the case of several degrees of freedom) . We can look at the Poisson bracket 



{Q,P} 



dQ 


dP 




dQ 


dP 


dq 




5 


dp 


q^ V 



(4.117) 
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At this point we need to do the playing with partial derivatives. Equation (4.116) 
defines P = P{q, Q), so we have 



(4.118) 



dP 




dQ 


dP 


dp 


1 


dp 


dQ 



dP 




dP 


+ 

Q 


dQ 


dP 


dq 


p 


dq 


dq 





Inserting this into the Poisson bracket gives 

{Q, P}-- 



dQ 


dP 


_ dQ 


d^F 


dQ 


dp 


dp 


q 9q 


Q 


g dqdQ 


dp 


dQ 

q ^ q 



(4.119) 



as required. The function F{q, Q) is known as a generating function of the first kind. 

There are three further types of generating function, related to the first by Leg- 
endre transforms. Each is a function of one of the original coordinates and one of 
the new coordinates. You can check that the following expression all define canonical 
transformations: 



F2{q,P): 
F3{p,Q): 
F4p,P): 



Pi 
Qi 
Qi 



dqi 
_dF\ 

dpi 
_dF\ 

dpi 



and Qi = 
and Pi = 
and Qi — 



dP^ 

dFs 

dQ^ 
dF\ 

dp 



(4.120) 



4.5 Action-Angle Vciriables 

We've all tried to solve problems in physics using the wrong coordinates and seen 
what a mess it can be. If you work in Cartesian coordinates when the problem really 
requires, say, spherical polar coordinates, it's always possible to get to the right answer 
with enough perseverance, but you're really making life hard for yourself. The ability 
to change coordinate systems can drastically simplify a problem. Now we have a much 
larger set of transformations at hand; we can mix up g's and p's. An obvious question 
is: Is this useful for anything?! In other words, is there a natural choice of variables 
which makes solving a given problem much easier. In many cases, there is. They're 
called "angle- action" variables. 

4.5.1 The Simple Harmonic Oscillator 

We'll start this section by doing a simple example which will illustrate the main point. 
We'll then move on to the more general theory. The example we choose is the simple 
harmonic oscillator. Notice that as our theory gets more abstract, our examples get 
easier! 
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We have the Hamiltonian 

H = ^ + -mwY (4.121) 
2m 2 ^ ^ ' 

so that Hamilton's equations are the familiar 

p = —muPq and q— — (4.122) 

m 

which has the rather simple solution Figure 60: 

q — Acos{ijj{t — tQ)) and p — —mujAs\n{uj{t — t^)) (4.123) 

where A and to are integration constants. The flows in phase space are ellipses as shown 
in the figure. 

Now let's do a rather strange change of variables in which we use our freedom to mix 
up the position and momentum variables. We write 

{q,p)^{ej) (4.124) 

where you can think of 6 is our new position coordinate and / our new momentum 
coordinate. The transformation we choose is: 



q — a/ sin^ and p — \JlImw cos9 (4.125) 
V mw 

It's an odd choice, but it has advantages! Before we turn to these, let's spend a minute 
checking that this is indeed a canonical transformation. There's two ways to do this 
and we'll do both: 

1) We can make sure that the Poisson brackets are preserved. In fact, it's easier to 
work backwards and check that {q,p} = 1 in {9,1) coordinates. In other words, we 
need to show that 

_dq dp dq dp 
{^,pW) = ^^-57 5^ = 1 (4-126) 

To confirm this, let's substitute the transformation (4.125), 




{?)P}(6»,/) = { \l sin 6, V 2Imw cos 9 



(0,1) 



= 2|V7sine, V/cos^l =1 (4.127) 

where the final equality follows after a quick differentiation. So we see that the trans- 
formation (4.125) is indeed canonical. 
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2) The second way to see that the transformation is canonical is to prove that the 
Jacobian is symplectic. Let's now check it this way. We can calculate 



(d9/dq d9/dp\ / (muj/p) cos^ 9 — (mujq/p) cos^ 9 
dl/dq dl /dp J \ mujq p/muj 

from which we can calculate J J and find that it is equal to J as required. 



(4.128) 



So we have a canonical transformation in (4.125). But what's the point of doing 
this? Let's look at the Hamiltonian in our new variables. 



E 



1 12/ 

- — (2mujl) sin^ 9 + - muJ^ cos^ 9 

2m 2 mw 



ujI 



(4.129) 



9 







(4.130) 



so the Hamiltonian doesn't depend on the variable 9\ This means 
that Hamilton's equations read 

OH ^ ■ dH 

We've managed to map the phase space flow onto a cylinder param- 
etcrised by 9 and / so that the flows arc now all straight lines as 
shown in the figure. The coordinates {9, 1) are examples of angle- 
action variables. 



Figure 61: 



4.5.2 Integrable Systems 

In the above example, we saw that we could straighten out the fiow lines of the simple 
harmonic oscillator with a change of variables, so that the motion in phase space became 
trivial. It's interesting to ask if we can we do this generally? The answer is: only for 
certain systems that are known as integrable. 

Suppose we have n degrees of freedom. We would like to find canonical transforma- 
tions 

{qi,Pi)^{9i,Ii) (4.131) 

such that the Hamiltonian becomes H — H{Ii, . . . ,In) and doesn't depend on 9i. If we 
can do this, then Hamilton's equations tell us that we have n conserved quantities /j, 
while 

dH 

where iVi is independent of 9 (but in general depends on J) so that the solutions are 
simply 9i = uoit. Whenever such a transformation exists, the system is said to be 
integrable. For bounded motion, the 9i are usually scaled so that < 6'j < 27r and the 
coordinates (6'^, Jj) are called angle-action variables. 



9i 



(4.132) 
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Liouville's Theorem on Integrable Systems: There is a converse statement. 
If we can find n mutually Poisson commuting constants of motion Ji, . . . , J„ then this 
implies the existence of angle-action variables and the system is integrable. The re- 
quirement of Poisson commutation {/j, /j} = is the statement that we can view the 
li as canonical momentum variables. This is known as Liouville's theorem. (Same 
Liouville, different theorem). A proof can be found in the book by Arnold. 

Don't be fooled into thinking all systems are integrable. They are rather special and 
precious. It remains an active area of research to find and study these systems. But 
many - by far the majority - of systems are not integrable (chaotic systems notably 
among them) and don't admit this change of variables. Note that the question of 
whether angle-action variables exist is a global one. Locally you can always straighten 
out the flow fines; it's a question of whether you can tie these straight fines together 
globally without them getting tangled. 

Clearly the motion of a completely integrable system is restricted to lie on Jj = 
constant slices of tfie pfiasc space. A tfieorem in topology says tfiat tfiese surfaces must 
be tori {S^ x . . . x S^) known as tfie invariant tori 



4.5.3 Action-Angle Variables for Id Systems 

Let's see fiow tfiis works for a Id system witfi Hamil- 
tonian 



(4.133) 




Figure 62: 



Since H itself is a constant of motion, witfi H — E for 

some constant E tfirougfiout tfie motion, tfie system is 
integrable. We assume tfiat tfie motion is bounded so tfiat 
gi < g < g2 as sfiown in tfie figure. Tfien tfie motion is 

periodic, oscillating back and fortfi between tfie two end points, and tfie motion in pfiase 
space looks sometfiing like tfie figure 63. Our goal is to find a canonical transformation 
to variables 9 and / tfiat straigfitens out tfiis flow to look fike tfie second figure in tfie 
diagram. 



So wfiat are I and 97 Since / is a constant of motion, it sfiould be some function of 
tfie energy or, alternatively. 



H = H{I) = E 



(4.134) 
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Figure 63: Can we straighten out the flow Unes in phase space? 



But which choice will have as its canonical partner 9 e [0, 27r) satisfying 
for a constant uj which is the frequency of the orbit? 



Claim: The correct choice for I is 




(4.136) 



which is the area of phase space enclosed by an orbit (divided by 27r) and is a function 
of the energy only. 



Proof: Since the Hamiltonian is conserved, we may write the momentum as a function 
of q and E: 



We know that for this system p = mq so we have 

Im dq 
V 2 - V{q) 

Integrating over a single orbit with period T — 2'k/u!, we have 



^ V 2 / ^E - V{q) 

= / V2^(J^VE-V{q) 



dq 



(4.137) 



(4.138) 



(4.139) 
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At this point we take the differentiation d/dE outside the integral This isn't obviously 
vahd since the path around which the integral is evaluated itself changes with energy 
E. Shortly we'll show that this doesn't matter. For now, let's assume that this is valid 
and continue to find 



27r 



_d_ 

dE 
d 

dE 
27r' 



2m^/E - V{q) dq 



pdq 



dl_ 

dE 



(4.140) 



UJ 



(4.141) 



where in the last line, we've substituted for our putative action variable /. Examining 
our end result, we have found that / does indeed satisfy 

dE 
If 

where uj is the frequency of the orbit. This is our re- 
quired result, but it remains to show that we didn't 
miss anything by taking d/dE outside the integral. 
Let's think about this. We want to see how the area 
enclosed by the curve changes under a small shift in 
energy 6E. Both the curve itself and the end points 
gi < g < g2 vary as the energy shifts. The latter 
change by 6qi = {dV {qi) / dq) 5 E . Allowing the dif- 
ferential d/dE to wander inside and outside the in- 
tegral is tantamount to neglecting the change in the 
end points. The piece we've missed is the small white 
region in the figure. But these pieces are of order SE"^. 
piece are given by 

^^E - V{q) ^ V2^VE - V{q) —6E 




Figure 64: 

To see this, note that order 6E 



f 

J Hi- 



dV 



(4.142) 



evaluated at the end point q 
This completes the proof. 



qi. They vanish because E = V{qi 



at the end points. 

□ 



This tells us that we can calculate the period of the orbit u by figuring out the area 
enclosed by the orbit in phase space as a function of the energy. Notice that we can 
do this without ever having to work out the angle variable 6 (which is a complicated 
function of q and p) which travels with constant speed around the orbit (i.e. satisfies 
e = ujt). 
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In fact, it's not too hard to get an expression for 9 by going over the above analysis 
for a small part of the period. It follows from the above proof that 



t 



_d_ 
dE 



pdq 



(4.143) 



but we want a 9 which obeys 9 — cut. We see that we can achieve this by taking the 
choice 

Because E is conserved, all Id systems are integrable. What about higher dimen- 
sional systems? If they are integrable, then there exists a change to angle-action vari- 
ables given by 



J2 Pj dQj 



9i = 



d 



(4.145) 



where the ji are the periods of the invariant tori. 

4.6 Adiabatic Invariants 

Consider a Id system with a potential V{q) that de- 
pends on some paramter A. If the motion is bounded by 
the potential then it is necessarily periodic. We want to 
ask what happens if we slowly change A over time. For 
example, we may slowly change the length of a pendulum, 
or the frequency of the harmonic oscillator. 

Since we now have A = X{t), the energy is not conserved. 
Rather E = E{t) where 



E W 


V(q) 









q 



E 



dH ■ 



Figure 65: 



(4.146) 



But there are combinations of E and A which remain (approximately) constant. These 
are called adiabatic invariants and the purpose of this section is to find them. In fact, 
we've already come across them: we'll see that the adiabatic invariants are the action 
variables of the previous section. 
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For the Id system, the Hamiltonian is 



and we claim that the adiabatic invariant is 

1 



7 = 



27r 



pdq 



(4.147) 



(4.148) 



where the path in phase space over which we integrate now depends on time and is 
given by p = 



J2m\/ E{t) — A(t)). The purpose of this section is to show that / 
is indeed an adiabatic invariant. At the same time we will also make clearer what we 
mean when we say that A must change slowly. 

Let's start by thinking of / as a function of the energy E and the parameter A. As 
we vary either of these, / will change. We have. 



7=^ 

dE 



A 



(4.149) 



E 



where the subscripts on the partial derivatives tell us what variable we're keeping fixed. 
For an arbitrary variation of E and A, this equation tells us that / also changes. But, 
of course, E and A do not change arbitrarily: they are related by (4.146). The point of 
the adiabatic invariant is that when E and A are related in this way, the two terms in 
(4.149) approximately cancel out. We can deal with each of these terms in turn. The 
first term is something we've seen previously in equation (4.141) which tells us that. 
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where T{X) is the period of the system evaluated at fixed A. 
The second term in (4.149) tells us how the path changes 
as A is varied. For example, two possible paths for two 
different A's are shown in the figure and the change in / is 
the change in the area of under the two curves. We have 
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dH 



E 



dp 



dt' (4.151) 



where, in the second equality, we have neglected a contribution arising from the fact 
that the path around which we integrate changes as A changes. But this contribution 
can be safely ignored by the same argument given around (4.142). 
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We can get a simple expression for the product of partial derivatives by think- 
ing differentiating the Hamiltonian and remembering what depends on what. We 
have the expression H{q,p,X) = E where, in the left-hand side we substitute p = 
\/2my/ E[t) — V{q] \{t)). Then differentiating with respect to A, keeping E (and q) 
fixed, we have 



dH 



+ 



dH 
dp 



dp 
dX 



= 



So substituting this into (4.151) we have 
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dX^~~2^Jo 'dX 

So putting it all together, we have the time variation of / given by 

dX 
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(4.153) 



(4.154) 



where, in the first term, we've replaced E with the expression (4.146). Now we're 
almost done. So far, each term on the right-hand side is evaluated at a given time t or, 
correspondingly for a given X{t). The two terms look similar, but they don't cancel! 
But we have yet to make use of the fact that the change in A is slow. At this point 
we can clarify what we mean by this. The basic idea is that the speed at which the 
particle bounces backwards and forwards in the potential is much faster than the speed 
at which A changes. This means that the particle has performed many periods before 
it notices any appreciable change in the potential. This means that if we compute 
averaged quantities over a single period. 



{A{X)) = ^ I A{t,X)dt 



(4.155) 



then inside the integral we may treat A as if it is effectively constant. We now consider 
the time averaged motion (/) . Since A can be taken to be constant over a single period, 
the two terms in (4.154) do now cancel. We have 



(/)=0 



(4.156) 



This is the statement that / is an adiabatic invariant: for small changes in A, the 
averaged value of / remains constant^. 

^The proof given above is intuitive, but begins to creak at the seams when pushed. A nice descrip- 
tion of these issues, together with a more sophisticated proof using generating functions for canonical 
transformations is given in in the paper "The Adiabatic Invariance of the Action Variable in Classical 
Dynamics" by C.G.Wells and S.T.Siklos which can be found at http://arxiv.org/abs/physics/0610084. 



- 113 - 



The adiabatic invariants played an important role in the early history of quantum 
mechanics. You might recognise the quantity / as the object which takes integer values 
according to the old 1915 Bohr-Sommerfeld quantisation condition 



1 

2^ 



pdq^nh neZ (4.157) 



The idea that adiabatic invariants and quantum mechanics are related actually predates 
the Bohr-Somerfeld quantisation rule. In the 1911 Solvay conference Einstein answered 
a question of Lorentz: if the energy is quantised as = hnuj where n G Z then what 
happens if u is changed slowly? Lorentz' worry was that integers cannot change slowly 
- only by integer amounts. Einstein's answer was not to worry: E/uj remains constant. 
These days the idea of adiabatic invariants in quantum theory enters into the discussion 
of quantum computers. 

An Example: The Simple Harmonic Oscillator 

We saw in section 4.5 that for the simple harmonic oscillator we have / = E/uj. So 
if we change uj slowly, then the ratio E/uj remains constant. This was Einstein's 1911 
point. In fact, for the SHO it turns out that there is an exact invariant that remains 
constant no matter how quickly you change u and which, in the limit of slow change, 
goes over to /. This exact invariant is 



-5 



where g{t) is a function satisfying the differential equation 



(4.158) 



g + uj\t)g--^=Q (4.159) 
9^ 

4.6.1 Adiabatic Invariants and Liouville's Theorem 

There's a way to think of adiabatic invariants using Liouville's theorem. Consider first 
a series of systems, all described by a Hamiltonian with fixed parameter A. We set off 
each system with the same energy E or, equivalently, the same action 7, but we start 
them with slightly different phases 9. This means that their dynamics is described by 
a series of dots, all chasing each other around a fixed curve as shown in the figure. 
Now let's think about how this train of dots evolves under the Hamiltonian with time 
dependent A(t). Recall that Liouville's theorem states that the area of phase space is 
invariant under any Hamiltonian evolution. This holds whether or not dH/dt = 0, 
so is still valid for the time dependent Hamiltonian with X{t). One might be tempted 
to say that we're done since all the words sound right: Liouville's theorem implies 
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that the area is conserved which is also the statement that our adiabatic invariant / 
doesn't change with time. But this is a httle too fast! Liouville's theorem says the 
area of a distribution of particles in phase space is conserved, not the area enclosed by 
a perimeter ring of particles. Indeed, Liouville's theorem holds for any variation \{t), 
not just for adiabatic variations. For a fast change of X{t), there is nothing to ensure 
that the particles that had the same initial energy, but different phases, would have 
the same final energy and we lose the interpretation of a ring of dots in phase space 
enclosing some area. 

The missing ingredient is the "adiabatic principle' . 
In this context it states that for a suitably slow change of 
the parameter A, all the systems in the same orbit, with 
the same energy, are affected in the same manner. If this 
holds, after some time the dots in phase space will still 
be chasing each other around another curve of constant 
energy E'. We can now think of a distribution of parti- 
cles filling the area / inside the curve. As A varies slowly, 
the area doesn't change and the outer particles remain the Figure 67: 

outer particles, all with the same energy. Under these cir- 
cumstances, Liouville's theorem implies the adiabatic invariant / is constant in time. 

4.6.2 An Application: A Particle in a Magnetic Field 

We saw in Section 4.1 that a particle in a constant magnetic field B = (0, 0, B) makes 
circles with Larmor frequency u = eB/mc and a radius R, which depends on the energy 
of the particle. But what happens if B is slowly varying over space? i.e. B — B{x,y), 
but with 




diB < R 

so that the field is roughly constant over one orbit. 



(4.160) 



In this example, there is no explicit time dependence of the Hamiltonian so we know 
that the Hamiltonian itself is an exact constant of motion. For a particle in a constant 
magnetic field we can calculate H of an orbit by substituting the solutions (4.32) into 
the Hamiltonian (4.28). We find 



H = -mu^R'' 
2 



e^R^B^ 
2mc2 



(4.161) 



This quantity is conserved. But what happens to the particle? Does it drift to regions 
with larger magnetic field B, keeping H constant by reducing the radius of the orbit? 
Or to regions with smaller B with larger orbits? 
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We can answer this by means of the adiabatic invariant. We can use this because the 
motion of the particle is periodic in space so the particle sees a magnetic field which 
varies slowly over time. The adiabatic invariant is 

I^l-jpdq (4.162) 

which is now to be thought of as a path integral along the orbit of the electron. We 
evaluate this on the solution for the uniform magnetic field (4.32) 
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Setting uo = eB/mc, we see that the adiabatic invariant / is proportional to {e/c)BF?. 
Since the electric charge e and the speed of light c are not things we can change, we find 
that BE? is constant over many orbits. But as ~ B'^E? is also conserved, both the 
magnetic field B seen by the particle and the radius of the orbit R must be individually 
conserved. This means the particle can't move into regions of higher or lower magnetic 
fields: it must move along constant field lines^. 

Finally, there's a simple physical way to see that the particle indeed drifts along lines 
of constant magnetic field. As the particle makes its little Larmor circles, it feels a 
slightly stronger force when it's, say, at the top of its orbit where the field is shghtly 
larger, compared to when its at the bottom. This net force tends to push the particle 
to regions of weaker or stronger magnetic field. But we've seen through the use of 
adiabatic invariants that this isn't possible. The slow drift of the particle acts such 
that it compensates for this small force, keeping the particle on constant field lines. 

There's a slight variant of the above set-up which allows you to trap charged particles 
using magnetic fields. Consider the particle making its little Larmor circles in the (,t, y) 
plane, but also moving in the z direction and take a magnetic field that's constant in 
the {x, y)-plane, but ever increasing in the ^-direction. The energy of the particle is 
given by, 

1 .2 e^R^B'^ 1 .2 (-I)emB 

H^-mz^^ = -mz^ + ^ ^ 4.164 

2 2mc 2 c ^ ^ 



^For results that go beyond the adiabatic approximations, see the paper by the man: E. Witten 
"v4 Slowly Moving Particle in a Two-Dimensional Magnetic Field\ Annals of Physics 120 72 (1979). 
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Both H > and (— /) > are constant in time. This ensures that there exists a value 
of the magnetic field i? > at which wc necessarily have i = and the particle can go 
no further. At this stage it turns round and goes back again. By creating a magnetic 
field that increases at two ends, charged particles can be made to bounce back and 
forth in the z direction, while executing their little circles in the {x, y)-plane. It is this 
mechanism that traps charged particles in magnetic loops emitted from the sun and is 
ultimately responsible for solar fiares. 

4.6.3 Hannay's Angle 

Consider a particle bouncing around, with periodic motion, in an arbitrary potential. 
There are many parameters Xa describing the shape of the potential. As we slowly vary 
the Afl the path in phase space changes although, as we have seen, the area enclosed 
by the path remains the same. After some time tiong (which, by the assumption of 
adiabiticity, is much longer than the period T of a single orbit) we return to the original 
parameters so that Aa(tiong) = Aa(0). The question we want to ask is: how has the 
phase angle 9 changed? 

For any fixed Xa, the velocity of the angle variable is ^ = dH/dl = Lj{I,Xa)- As 
we slowly vary the parameters, the particle is spinning around its phase space orbits. 
When we return we therefore expect that the phase has been shifted by J udt. Which 
is true. But it turns out that there is another, more subtle, contribution to the phase 
shift as well. We'll now see where this comes from. 

As the parameters change, we can write the change in the angle 9 as 

Which looks simple enough. But there's a problem. The second term is not well defined. 

For each set of parameters Xa we have different action angle variables I{Xa) and 9{Xa)- 
But there's nothing that stops choosing a different origin ^ = for each choice of the 
parameters. In other words, we could always redefine 

9{Xa) ^ 9{Xa) + (3{Xa) (4.166) 

where we shift by a different constant /5 for each Xa- What this means is that it doesn't 
really make any sense to compare the angle variable for different parameters Xa- This 
makes the second term — which tells us how the angle variable changes as we change 
the parameters — ambiguous. One might think this means that we can just ignore it. 
Or, more precisely, we could choose the shifts (3 so that the angle variables are defined 
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in such a way that the second term vanishes. But it turns out that this isn't possible. 
Let's see why. The point is that it does make sense to compare the angle variable for 
the same parameters Aa. After such a time tiong, we have 

0{tiong) =9{0)+ / udt + Ae (4. 167) 

Jo 

The term J cudt is the dynamic term that we anticipated above, arising from the fact 
that 9 is continually making orbits around the curve in phase space. It depends on the 
time iiong that we took to make the change. The other term that we call is more 
interesting. Prom (4.165) it is given by 




(4.168) 



where we've used the fact that the change in is adiabatic to replace the integrand 
with its average over one period of the orbit. The final expression above means an 
integration over the curve C that the system traces in parameter space. Wc sec that 
is independent of the time tiong taken to make the change. However, it docs depend 
on the path that we took through the space of all possible potentials. It is known as 
the "Hannay angle". Note that the Hannay angle is invariant under the ambiguity 
(4.167) even though the quantity d9/dXa that appears in the integrand isn't. This 
idea of integrating quantities around loops is an example of "holonomy", an important 
concept in modern physics. 

Rather surprisingly, the Hannay angle was first discovered only in 1984. The history 
is interesting. First Berry discovered a similar phase for the wavefunction in quantum 
mechanics (now known as the Berry phase). Many physicists were shocked that such 
a simple and profound idea as Berry's phase had lain hidden in the formulation of 
quantum mechanics for 50 years and it set off a flurry of theoretical and experimental 
research. Soon after this, Hannay showed that an analogous phase had lain undis- 
covered in classical mechanics for 150 years! Although, in certain examples in celestial 
mechanics, the phase A9 had been correctly calculated, the general theory lying behind 
it had not been appreciated. We now describe this theory. 

The flrst step is to use a higher dimensional version of Stokes' theorem to express 
the contour integral (4.168) as a surface integral 
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where S" is a surface in parameter space bounded by the curve C and dAab is the in- 
finitesimal surface element. 



Claim: The Hannay angle can be written as 



(4.170) 



where the anti-symmetric matrix Wab (known mathematically as a 2-form) is given by 

89 81 80 81 



W, 



ab 



8\a 8Xb 



8\h 8\n 



(4.171) 



Proof: To start with let's think about the averaging procedure a little more. In 
equation (4.155) we wrote (^4) as a time average, integrating over a single period. We 
could equally as well write it as an angle average, 

{A) = ^A{I,9)d9 = J A{q',p')S{I'-I)^ (4.172) 

where in the second equality we integrate over all of phase space and insert the delta 
function 6{I' — I) to restrict the integral to the orbit over the curve with action /. It's 
this latter formula that we'll use. This allows us to write, 
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(4.173) 



which completes the proof. I haven't included any examples here of the Hannay angle: 
some simple ones can be found in the original literature^ and more advanced applica- 
tions can be found by googling "Hannay angles". For the pendulum example, in which 
the length of the pendulum varies, the Hannay angle vanishes. This is because there is 
only one parameter to vary, while a non-trivial occurs only if we make a non-trivial 
loop C in parameter space. 

^J. Hannay, "Angle Variable Holonomy in Adiahatic Excursion of an Integrable Hamiltonian" J. 
Phys A, 18 221 (1985); M. Berry, "Classical Adiahatic Angles and Quantal Adiahatic Phase" J. Phys 
A, 18 15 (1985). 
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4.7 The Hamilton-Jacobi Equation 

In this section we will describe yet another viewpoint on 
classical dynamics, known as Hamilton-Jacobi theory. It will 
tie together several concepts that we've met so far. Recall 
from section 2.1 the principle of least action. We define the 
action 




= / L{Qi,Qi,t) 
Jo 



dt 



(4.174) 



Figure 68: 



which we evaluate for all paths q{t) with fixed end points 
Then the true path taken is an extremum of the action: 5S — 0. 



(4.175) 



Now let's change perspective a little. Consider the action evaluated only along the 
true path g^iassicai^^^ define 



classical / 



(4.176) 



While 5" is a functional on any path, W is to be considered as a function of the initial 
and final configurations gr?"*'^i and as well as the time T it takes to get between 
them. 

Now let's ask what happens if we keep g™'*'^' fixed but vary the end point qf^^^. We 
can go back to the analysis of section 2.1 to see that when the action is varied it looks 
like 
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If we evaluate this on the classical path the first term vanishes. We're left with 
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(4.178) 



t=T 



The next thing we want to compute is dW/dT. Let's start by considering a classical 
path with fixed initial configuration q^J^^^^^^. We'll let the path run on a httle longer 
than before, soT ^T + 5T. Then we have 
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(4.179) 
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But this total derivative is easily calculated since dS/dT = L, or 

^ = L(gf ™i(T), qf^^^c-\T),T) = L{<i^^\ qf^^\ T) (4.180) 



So we arrive at the equation, 

dW 



/^final -final r /^final -final rp\\ 

— - [Pi Qi - HQi ' ?i ' J j 



dT 

^ -H{qf-^\l^-^\T) (4.181) 

At this stage, the only time in the game is T and the only position in the game is gf"'^^ 
So we can simply drop the word "final", and relabel T — > t. We have found ourselves 
a time dependent function on configuration space W — W{qi,t) which satisfies 

dW dW 

-Q^=Pi and ^ -H{qi,pi,t) (4.182) 

or, substituting the first into the second, we have 

dW 

-^ = -H{qi,dW/dqi,t) (4.183) 

This is the Hamilton- Jacobi Equation. 

We've shown how that a solution to the Hamilton- Jacobi equation can be constructed 
by looking at the classical action of paths which reach a point q^ at time T starting 
from some initial reference point g^"'*'^'. The starting point g^"^'*'^' can be considered 
integration constants. In fact, there are more general solutions to the Hamilton-Jacobi 
equation, although all are related to the classical action in a similar way. 

Suppose we find a solution to (4.183). What do we do with it? We're now armed 
with some time-dependent function W{qi,t) on configuration space. We combine this 
with the first of Hamilton's equations which reads 



OH 



(4.184) 

Pi=aW/dqi 



where, on the right-hand- side, we've replaced every appearance of the momenta Pi by 
a function of the coordinates using pi = dW/dqi. What we're left with is n first-order 
differential equations for the evolution of q^. In this manner the function W determines 
the path of the classical system: start it ofi^ at a point in configuration space and W 
can be considered as a real valued classical wavefunction which tells it how to evolve. 
What we need to show is that the evolution dictated by (4.184) does indeed satisfy the 
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equations of motion. In other words, we should prove that the second of Hamilton's 
equations, pi = —dH/dqi, is satisfied. We have 



d (dW 



dt \ dqi 



dqidqj dtdqi 



(4.185) 



But differentiating the Hamilton-Jacobi equation (4.183) with respect to g^, we see that 
we can rewrite the right-hand-side of this equation using 



dH dH d'^W 



dtdqi dqi dpj dqidqj dq. 
So that (4.185) becomes Pi = —dH/dqi as required. 
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(4.186) 



Let's see what we've done. We're used to dealing with second order differential equa- 
tions for the time evolution on configuration space (i.e. Lagrange's equations) and 
first order differential equations for time evolution on phase space (Hamilton's equa- 
tions). But the Hamilton-Jacobi approach allows us to incorporate n of the integration 
constants in the function W{qi, t) so that we're left solely with first order differential 
equations on configuration space given by (4.184). 

When we have conservation of the Hamiltonian, so dH/dt = 0, there is solution of 
the Hamilton-Jacobi equation of a particularly simple form. Define 



Wiqi,t)^W'iqi)-Et 



(4.187) 



for some constant E. Then the time dependence drops out and we get the equation 

H{qi,dWydqi)^E (4.188) 

is known as Hamilton's principal function. The spe- , ^ \ 

cial property of this solution to the Hamilton-Jacobi 
equation is that every path in configuration space de- 
termined by the function Wq has the same energy E. r 
With a httle thought, we can envisage how to construct 
solutions to (4.188). Start with a co-dimension one sur- 
face in configuration space which we will specify to be a 
surface of constant Wq. (Co-dimension one means that Figure 69: 

the surface has dimension (n — 1): it splits the configura- 
tion space in two). At any point in this surface, the potential energy V{q) is determined. 
Since Pi = dWo/dqi, the momentum is perpendicular to the surface and in the direc- 
tion of increasing Wq. Its magnitude is fixed by requiring that the total energy is E. 



level surfaces of W 
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But this magnitude then tells us the position of the next surface of constant Wq (with 
incremental increase). In this manner, it should be clear that, in multi-dimensional 
configuration spaces there are many solutions to (4.188). It should also be clear that 
something singular happens to Wq in regions where V{qi) — 0. Finally, we note that 
even when dH/dt — 0, there exist other solutions W to (4.183) which encode famihes 
of trajectories on configuration space which have different energies. 

4.7.1 Action and Angles from Hamilton- Jacobi 

For the majority of examples the Hamilton- Jacobi approach doesn't give a particularly 
useful way for solving a problem; its utility really lies in the structure it reveals about 
classical dynamics. So rather than go through the gymnastics of solving a complicated 
problem using this method, let us focus on a rather simple example which which il- 
lustrates connections between the different ideas we've seen. A system with a single 
degree of freedom has Hamiltonian 

H-^ + ViQ) (4-189) 

Since the Hamiltonian is time independent, energy is conserved and the solution to 
the Hamilton- Jacobi equation has a single integration constant, let's call it /3, which 
is necessarily some function of the energy. In the above discussion we were a little lax 
about showing these integration constants explicitly, but let's do it now: we'll write 
W = W{q, t; 13) with (3 = (3{E). Now we ask a somewhat strange question: suppose we 
try to perform a canonical transformation from (g,p) to new coordinates {a, (3) such 
that P is the new momentum. What is the new coordinate a? 

Since we wish the change of coordinates to be canonical, we must be able to write 
q = q{a, (3) and p = p{a, (3) such that 

_ dq dp dq dp 
{^,pW) = ^^-^^-1 (4.190) 

Using p = dW/dq and remembering what all depends on what all {W = W{q,(3) and 
q = q{a, j3) and p = p{a, (3)) we can write this as, 

_ dq r O^W d^W dq\ dqd^W dq _ dq d rdW\ 
{Q,P}ia,p) - ^Qf^dq + 'd^djj ) ~ df3~d^d^ ~ d^Wq \ dp ) ^' ^ 

and we find that the transformation is canonical if we take a = dW/ df3. Note the nice 
symmetry here: we have a solution W{q, t; j3) to the Hamilton Jacobi equation and we 
can think in terms of canonical coordinates {q.,p) or alternatively (a,/?) where 

dW 
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The function W is an example of a generating function of the second kind (4.120). 

So what to do with this? Let's look at some examples. Take (3 — E, so that our new 
momentum variable is the energy itself. What is the canonical coordinate? If we write 
W{q, t; E) — VFo(?, E) — Et then the coordinate canonically dual to E is 

dWn 

a^^{q,E)-t (4.193) 

Taking the time dependence over the left-hand-side, we see that a has the interpretation 
of —to, the initial starting time. This tells us that we may paramctcrise every trajectory 
in a one-dimensional system in terms of the energy and starting time of the path, and 
that these are canonical variables. Again we see the dual relationship between energy 
and time. Note that both E and to are independent of time; we've found canonical 
variables for which neither the coordinate nor the momentum vary along the path. 

As another example consider the case of /5 = /, our action variable of Section 4.5. 
What is the canonical coordinate a in this case? We expect that it will be related to 
the angle variable 9. To see this, we use the fact that W is the classical action to write 



Wo 

So we have that 



J Ldt + Et^ J{L + H)dt^ J qpdt^ J pdq (4.194) 



dW d f , dE ^ 
--^-Jfjpdq--t^e-.t (4.195) 

where we've used our expression (4.144) for the angle variable, as well as the equation 
(4.141) for the frequency of motion u. So we see that a is not quite equal to 9, 
but is shifted by a term linear in time. In fact this means that a itself docs not 
change in time. Once again, we've arrived at a way to parameterise the orbits of 
motion by canonical variables which do not themselves change with time. In fact, in 
most presentations, this is the starting motivation for the Hamilton- Jacobi approach 
to classical dynamics and, even for higher dimensional systems, the function W can 
be thought of as a way to generate new, time independent, canonical variables. More 
discussion on the relationship between canonical transformations, angle-action variables 
and the Hamilton- Jacobi formulation can be found in the book by Hand and Finch, or 
deep within Goldstein. 

4.8 Quantum Mechanics 

One of the primary reasons for studying the rather formal aspects of classical mechanics 
discussed in this course is to make contact with quantum mechanics. For this reason. 
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in this last section of the course we will illustrate the connection between the classical 
and quantum world and point out a few analogies that might just make the quantum 
behaviour look a little less weird. (Just a little less: after all, it really is weird!) 

In classical mechanics the state of a system is described by a point {qi,Pi) in phase 
space. In quantum mechanics the state is described by a very different object: a 
complex valued wavefunction ip{q) over the configuration space. The observables are 
operators on the space of wavef unctions. The standard representations of the position 
operator qi and momentum operator pi are 

pMq) = (4-196) 
oqi 

which leads to the well known Heisenberg commutation relations 

[qi, qj] = 

[qi,qj]^0 (4.197) 
[Qi,Pj] = ifi^ij 

Of course, we've already seen something very familiar in section 4.3 on Poisson brackets 
as summarised in equation (4.61). Although the bilinear, antisymmetric operators 
[ , ] and { , } act on very different spaces, they carry the same algebraic structure. 
Heuristically the relations (4.61) and (4.197) both represent the mathematical fact 
that momentum Pi generates infinitesimal translations of qi'. in classical mechanics we 
saw this in when we studied infinitesimal canonical transformations in section 4.4.1; in 
quantum mechanics it follows from the representation (4.196) and Taylor's expansion. 

In general the map between a classical system and a quantum system goes via the 
Poisson brackets and was formulated by Dirac: 

{ ) ^classical ^ ' ■ll'^^^*"'^ (4.198) 

This prescription for going between the classical and quantum theories is known as 
canonical quantisation. It also gives rise to the quantum equations of motion. In the 
Poisson bracket language, we have seen that the classical equation of motion for an 
arbitrary function f{q,p) is 

/ = {/,//} ihf^[f,H] (4.199) 

which is the equation of motion in the Heisenberg picture, in which the time dependence 
is assigned to the operator rather than the state. 
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While a great physicist, Dirac was never much of a storyteller. It shows in 
following anecdote recounting his graduate student days: 

"/ went hack to Cambridge at the beginning of October 1925, and resumed 
my previous style of life, intense thinking about these problems during the 
week and relaxing on Sunday, going for a long walk in the country alone. 
The main purpose of these long walks was to have a rest so that I would 
start refreshed on the following Monday. 

It was during one of the Sunday walks in October 1925, when I was think- 
ing about this {uv — vu), in spite of my intention to relax, that I thought 
about Poisson brackets. I remembered something which I had read up previ- 
ously, and from what I could remember, there seemed to he a close similarity 
between a Poisson bracket of two quantities and the commutator. The idea 
came in a flash, I suppose, and provided of course some excitement, and 
then came the reaction "No, this is probably wrong". 

I did not remember very well the precise formula for a Poisson bracket, 
and only had some vague recollections. But there were exciting possibilities 
there, and I thought that I might he getting to some big idea. It was really 
a very disturbing situation, and it became imperative for me to brush up on 
my knowledge of Poisson brackets. Of course, I could not do that when I 
was right out in the countryside. I just had to hurry home and see what I 
could find about Poisson brackets. 

I looked through my lecture notes, the notes that I had taken at various 

lectures, and there was no reference there anywhere to Poisson brackets. 
The textbooks which I had at home were all too elementary to mention them. 
There was nothing I could do, because it was Sunday evening then and the 
libraries were all closed. I just had to wait impatiently through that night 
without knowing whether this idea was really any good or not, but I still 
think that my confidence gradually grew during the course of the night. 

The next morning I hurried along to one of the libraries as soon as it 
was open, and then I looked up Poisson brackets in Whitackers Analytical 
Dynamics, and I found that they were just what I needed." 
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4.8.1 Hamilton, Jacobi, Schrodinger and Feynman 

While the Poisson bracket structure of quantum mechanics dovetails nicely with Heisen- 
berg's approach, the Hamilton- Jacobi equation is closely tied to Schrodinger's wave 
equation. Let's first recall what Schrodinger's equation looks like for a one- dimensional 
system with a Hamiltonian operator H = /2m + V{q) acting on wavefunctions V'(g), 

where wc have used the representation (4.196) for the position and momentum opera- 
tors. To sec the relationship of this equation to classical dynamics we decompose the 
wavefunction into the modulus and phase, 

iPiq, t) = R{q, t) e^^^i'ty (4.201) 



where R and W are both real functions. We know that R is related to the probability 
P for finding a particle at position q at time t: P{q, t) = \ip{q, t)p = R{q, t)^- But what 
is the interpretation of the phase W7 Let's substitute this decomposition of ip into the 
Schrodinger equation to find 



ih 



dR iR dW 



'2m 



d^R 2i dRdW 
dq^ h dq dq 



R 



^ iR d^W 
+ 



h dq^ 



+ VR 



At this stage, we take the classical limit h 
situation with 



0. Or, more precisely, we consider a 



dq^ 



dW 



dq 



(4.202) 



which can be understood physically as the requirement that the de Broglie wavelength 
of the particle is much smaller than any other length scale around. Either way, collecting 
together the terms above to leading order in h we find 



dW J_ 



dW 
dq 



+ V{q) = 0{h) 



(4.203) 



which we recognise as the Hamilton- Jacobi equation (4.183). So in the classical limit 
the phase of the wavefunction is understood as the classical action of the path taken 
by the particle. 
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Finally, let us finish on the same topic that we started: the principle of least action. 
Recall from section 2.1 that we can determine the true path of a system by assigning a 
number, called the action S, to every possible path. The equations of motion are then 
equivalent to insisting that the true path is an extremum of S. But what about all the 
other paths? Do they play any role in Nature? The answer is that, in the quantum 
world, they do. Suppose a particle is observed to be at position qi at time t — 0. Then 
the probability P that it will later be observed to be at position qf at time t — T is 
encapsulated in the wavefunction ip{qf, T). The Feynman path integral formula for the 
wavefunction is 

T)^N r Vq{t) e^^l^Wl/'^ (4.204) 

The here is just a normalisation constant to ensure that probabilities add up to one: 
i.e. j \ilj{q)\^dq = 1. The tricky part of this formula is the integral: it is a sum over all 
possible paths from q = q^ at time to g = g/ at time T. These paths are weighted 
with their action. It's as if the particle really does take every possible path, but with 
a particular phase. In the hmit /i — > 0, the phases oscillate wildly for any path away 
from the classical equation of motion SS — and they cancel out in the integral. But 
for situations where H is important, the other paths are also important. 

Let's prove that the wavefunction defined by (4.204) satisfies the Schrodinger equa- 
tion. Firstly we need to understand this integral over paths a little better. We do this 
by splitting the path into n small segments, each ranging over a small time St — t/n. 
Then we define 

/ Vqit) = lim n ^ (4.205) 

where qk is the position of the particle at time t = k6t. In this expression C is a 
constant that we're going to figure out shortly that will be required to make sense of 
this infinite number of integrals. In any given segment, we treat the path as straight 
lines as shown in the figure and replace the action with the appropriate quantity, 

s^j\t (i„,^-n,))^|:(f(^-«.(^ 

Then to prove that ip defined in (4.204) satisfies Schrodinger's equation, let's consider 
adding a single extra time step to look at the wavefunction at time t + St. We can 
Taylor expand the left hand side of (4.204) happily 

^{qf, T + St)^ ^{qf, T) + ^St + 0{St') (4.206) 
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Figure 70: Discretising the paths. 



while the right hand side requires us to do one extra integral over the penultimate posi- 
tion of the path q' . But the first n integrals simply give back the original wavefunction, 
now evaluated at q' . We get 



im {qf - q'f _ i ( qj + q' \ 

2h 5t h^^^y 2 / 



The term in the exponent means that the integral oscillates wildly whenever q' is far 
from qf and these regions of the integral will all cancel out. We can therefore Taylor 
expand around {qf — q') to rewrite this as 



f+°° dq^ 

J-oo ~C 



exp 



im {qf — q'Y 

m 6i 



iSt 



1 - '-^V{qf] 



At this stage we do the integral over q' . We'll use the formulae for Gaussian integration 



dye-"' = r- 



j dy ye-''' = , J dy y^-"" = (4.209) 



a 



Then equating the lefthand side (4.206) with the righthand side (4.208), we have 



At this stage we see what the constant C has to be to make sense of this whole calcu- 
lation: we should take 



[2Mt 
—im 



(4.210) 
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so that C — as (5t — 0. Then the terms of order 0{6t^) agree. Collecting the terms 
of order 0{6t), and replacing the time T at the end point with the general time t, we 
see that we have 

and we recover Schrodinger's equation as promised. 
4.8.2 Nambu Brackets 

Throughout this section, we've seen that several of the structures appearing in quantum 
mechanics were anticipated, in some form, within the framework of classical dynamics. 
You just need to know where to look. One might wonder whether classical mechanics 
also contains other structures which will prove to be important in future discoveries. 
Or, alternatively, whether there are ways to extend the framework of classical dynamics 
that hints at new ways to formulate the laws of physics. In this section, I'll briefly 
describe such an extension due to Nambu in 1973. I should confess immediately that 
there's no known use for Nambu's formalism! And it's not at all clear that one will be 
found! But then again, maybe it holds the clue that will prove crucial in the search for 
the ideas beyond the known laws of Nature. 

We've seen that the Hamiltonian framework deals with canonical pairs of coordinates 
and momenta {qi,Pi) with i = 1, . . . ,n. Nambu's idea was to extend this to triplets of 
objects {qi,Pi,ri) with i — 1, . . . ,n. We don't say what this extra variable is: just 
that it is necessary to define the state of a system. This means that the phase space 
has dimension 3n. The Nambu bracket, which replaces the Poisson bracket, acts on 
three functions /, g and h in phase space, 

' ^' ^ dqi dpi dr-i dqi dvi dpi dri dqi dpi 

_ a/ 9/i ^ 9/ 9^ a/i _ a/ a/i 9/i (4212) 

dri dpi dqi dpi dri dqi dpi dqi dvi 
This satisfies similar properties to the Poisson bracket, including linearity and 

• Anti-symmetry: {/, g, h} = -{^f, /, h} = {g, h, /}. 

• Leibniz: {fg, h, 1} = f{g, h, 1} + {/, h, l}g. 

• "Jacobi" : {{/, g, h}, I, m} + {h, {/, g, /}, m} + {h, I, {/, g, m}} = {/, g, {h, I, m}}. 
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In order to specify time evolution, we need two " Hamiltonians" . We call them H{q,p, r) 
and G{q,p,r). Then any function / over phase space evolves as 

f = {/, G, H} (4.213) 
In particular, the new version Hamilton's equations read 
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dpi dqi 



(4.214) 



where there's no sum over i on the right-hand side of these equations. By the anti- 
symmetry of the Nambu bracket, wc learn that both H and G are conserved (as long 
as neither have explicit time dependence). 

Many of the key features of classical dynamics are retained in Nambu's formalism. 
For example, Liouville's theorem still holds. (This was Nambu's original motivation 
for suggesting this framework). Similarly, canonical transformations can be defined as 
a change of variables qi — > Qi{q,p, r) and Pi — >• Pi{q,p, r) and rj — >• Ri{q,p, r) such that 
the Nambu bracket structure is preserved, for example 

{Qi,Pj,Rk}=\ ^ '^' = ^ = ^ (4.215) 
[ otherwise 

together with similar equations involving other combinations of Q's, P's and i?'s. 
"Hamilton's" equations (4.214) are invariant under these canonical transformations. 

The Nambu bracket provides a generalisation of classical dynamics. But can we 

quantise it? In other words, can wc find some operators which reproduce the Nambu 
bracket structure (up to a factor of h and perhaps an i) in much the same way that the 
usual quantum theory ties in with the Poisson bracket? This turns out to be pretty 
tricky. In particular, it seems difficult to keep all three conditions: anti-symmetry, 
Leibniz and Jacobi. Perhaps this suggests that the correct mathematical structure has 
not yet been uncovered. Perhaps it suggests that the Nambu bracket is just not useful! 

Chances are that you won't ever have any use for the Nambu bracket. But you never 
know. Perhaps one day, like Dirac, you'll return from wandering the fields around 
Cambridge and desperately need to recall this concept. But, unlike Dirac, you'll be 
able to a find reference in your lecture notes (the notes you had taken at various 
lectures) . 
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